Building a Rich Large-scale Lexical Base for Generation

نویسنده

  • Hongyan Jing
چکیده

Most large lexical resources have been developed with language interpretation in mind and can not be used directly for generation. We present a rich large-scale lexical base for generation, constructed by merging various linguistic resources. Our approach meets the needs of language generation systems by providing the facilities for mapping from semantic concepts to verb/sense pairs, for identifying the valid subcategorization forms for a given verb sense, and for representing alternations for paraphrasing power. Information from diierent resources enriches and constrains each other, so the nal result is complete as well as accurate. We show by example how this lexical base can be intergrated into a generation package and how it simpliies development process while improving system performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining Multiple, Large-Scale Resources in a Reusable Lexicon for Natural Language Generation

A lexicon is an essential component in a generation system but few efforts have been made to build a rich, large-scale lexicon and make it reusable for different generation applications. In this paper, we describe our work to build such a lexicon by combining multiple, heterogeneous linguistic resources which have been developed for other purposes. Novel transformation and integration of resour...

متن کامل

Controlling The Application Of Lexical Rules

In this paper, we describe an item-familiarity account of the semi-productivity of morphological and lexical rules, and illustrate how it can be applied to practical issues which arise when building large scale lexical knowledge bases which utilize lexical rules. Our approach assumes that attested uses of derived words and senses are explicitly recorded, but that productive use of lexical rules...

متن کامل

Integrating a Large-Scale, Reusable Lexicon with a Natural Language Generator

This paper presents the integration of a largescale, reusable lexicon for generation with the FUF/SURGE unification-based syntactic realizer. The lexicon was combined from multiple existing resources in a semi-automatic process. The integration is a multi-step unification process. This integration allows the reuse of lexical, syntactic, and semantic knowledge encoded in the lexicon in the devel...

متن کامل

Combining Dictionary-Based and Example-Based Methods for Natural Language Analysis

We propose combining dictionary-based and example-based natural language (NL) processing techniques in a framework that we believe will provide substantive enhancements to NL analysis systems. The centerpiece of this framework is a relatively large-scale lexical knowledge base that we have constructed automatically from an online version of Longman's Dictionary of Contemporary English (LDOCE), ...

متن کامل

Robust Natural Language Generation from Large-Scale Knowledge Bases

We have begun to see the emergence of large-scale knowledge bases that house tens of thousands of facts encoded in expressive representational languages. The richness of these representations o er the promise of signi cantly improving the quality of natural language generation, but their representational complexity, scale, and task-independence pose great challenges to generators. We have desig...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997